DOCOHEIT RESUBE 



ED 083 345 



oD 013 866 



AOTHOR 
TITLE 

SPONS A6EHCT 
POB DATE 
GRANT 
ROTE 



EDRS PRICE 
DESCRIPTORS 



IDENTIFIERS 



Dusek, Jeroae B« 

Teacher and Experiienter Bias Effects on Children's 
Learning and Perfornance« 

Office of Education (DHEH) , Washington, D.C. 

28 Aug 73 

OBG-2-71-0516 

39p«; Paper presented at the American Sociological 
Association annual aeeting, Nev York, N.T., August 
28^ 1973 

HF-$0.65 HC-$3.29 

^'Acadeaic Performance ; Achieveaent Tests; ^'Bias; 
Childhood Attitudes; Classrooa Environaent; 
Educational Experiaents; ^'Eleaentary Grades; 
Expectation; Interaction Process Analysis; ^Learning; 
Longitudinal Studies; Predictor Variables; ^Teacher 
Attitudes 

SAT; Stanford Achieveaent Test 



ABSTRACT 

Three experiments vere conducted to examine the 
effects of adult expectations on children's learning and performance; 
one in-classrooB study and tvo exi^riaental studies vere aade in 
order to investigate devalopaental trends in susceptibility to 
expectancy effects and the relationship of induced trs« self -generated 
expectancies Tis-a-vis children's learning and perforaance* The aajor 
experiment vas a 1-1/2 year longitudinal study of teacher bias and 
expectancy effects on the Stanford Achieveaent Test (SAT) performance 
of children in two grade 2 and grade 4 classrooas. The aajor findings 
vere that: tellina teachers students vill do veil did not alter 
children's SAT performance; teacher ranking vas signif icanlty related 
to SAT perfornance froa each of the five testing periods; and, there 
vere no interactions vith grade level* These findings vere 
inter piTe ted as indicating that teachers are good predictors of 
children's acadenic potential and do not *'bias*' children's education* 
The finding of the grade level X experinenter sex X sex of Ss X tine 
study of experinenter bias vas the significant triple interaction 
involving grade level, bias condition, and sex of Ss; this 
interaction reflected a general trend of older Ss to be nore 
influenced by biasing effects of experinenters than younger Ss. The 
second experinental study revealed essentially rhe sane effects for 
experinenters in vho bias vas induced and those vho predicted 
performance thenselves (self-generated bias). (Author/RJ) 



FILMED FROM BEST AVAILABLE COPY 



9 



US OtPAKTMINTOF HIALTM. 
lOuCATiON « WILFAKI 
NATIONAL INSTITUTI OF 
lOUCATlON 

THIS tXJCUMCNT HAS BEEN REPRQ 
DUCEO EXACTLY AS RECEivfO r rqv 
tmE person or organization origin 
at»%g »t pojnts view or opinions 

STATED 00 NOT NECESSARILY REPRE 
SENT OfFlCiAL NATIONAL INSTITUTE Of 
EDUCATION POSITION OR POLICY 



on Children's Learning; divA IV-rf onuancc 



Jerorae I> . Dusek 



Paper presented at the 68th Annual Convention of the .\nierican Sociolo7,ical 
Association, New York, New York, August 28, 1973. 



CO 



ERIC 



Teacher and Exnc.r inieiiLer iilas rffccLs 
on Children's Learning and Perfonnance 

I . Jerome B, Dusck 

Syracuse UniversiCv 
Abstract: 

Three exper iiiieats v;ere conducted in order to examine the el'fects of 
adult expectat ionfj on children's learning and performance. One in-classroom 
study and two experimental studies were conducted in order to investi^.;ate 
developmental trends in susceptibility to expectancy effects and ttie relation- 
ship of induced vs. self-generated expectancies vis-a-vis children's learning 
and performance. The major experiment was a 1-i year lonij;itudinal study oi 
teacher-bias and teacher-expectancy effects on the vStanford Acliievcinent Test 
(SAT) performance of children in two second- and two f our th-i;rade classroom:.;. 
The major findings were: (a) telling teachers students will perform v/eli did 
not alter cliildren's SAT performance; (b) teacher ranking v;as significantly 
related to SAT performance from each of the five testing periods; (c) there 
were no interactions with grade level'; These findings were interpreted as 
indicating that teachers are good {-redictors of children's academic potential 
but do not "bias" children's education. The major finding of the 3 (.Grade Level) 
X 2 (Sex of Experimenter) x 2 (Sex of Subject) x 7 (Minutes) study of experimenter- 
bias in a simple motor performance tcisk (niarbie dropping) was the significant triple 
interaction involving Grade Level (1st, 3rd, 5tiO » ^^ins Condition, and Sex of Sub- 
ject. This interaction reflected a general trend for older subjects to be more 
influenced by biasing effects of experimenters than younger subjects. The second 
experimental study,, a 2 (Induction Condit^ion) x 2 (Bias Condition) x 2 (Sex of 
Subject) -x 7 (Minutes) design, revealed essentially the same effects for exper imeniers 

ERLC 



iu whoia the bias was induced and thuse vho prodictod }>ei i vtn\au'e 
thumsoives (self-^;eneratod bias). 



ERIC 



The ceatral problem under iuvesCit'.at: ion was the eU'ect o/ adui i / tecici-.or 
expectations on children's learnini; and ner f or:::nnce . I'iiree studies verc con- 
ducted to provide iiiiorination relevant to Liie iuilowin:.; tlirce questions: 
a) Are teacher-bias or teaciier-exi^ec tancy effects observable in measures 
academic perfonnance? b) Are these eifecLs obscrvabJ.e only v;hen induced in tb.e 
teacher or exper imei^ter by the ]u*incipal investi^;ator as opposed beini: sell- 
generated by the adult? c) Are tliere develojMuental trends in susceptibility to 
adult expectancy effects? 

Research bearing on these issues i^ills into tbiree cate^;ories. !M rs t , " l lierc 
is a body of research dealing with experimenter bias effects in psyciiol o^j^icai 
research. ^ This research has been thorourjily reviewed by RoscnthaJ, (1"J65, l^bb, 
1969a, b), Friedman (1967) and Barber and Silver (i96Sa, 196Sb). Ib.e literatur. 
in this area is a clear demonstration that under certain conditions experimenter- 
may intentionally or unintentionally bias the performance oi adults (Koscntha] , 
1966; Barber L Silver , 1968a-, 196Sb) or children (Dusek, 1971; 1972:> in psycho- 
logical experiments. Second, tliero are several studies in w'nich expecuancy 
effects and self- fulf illin;^ prophecies have been investigated in tutoring situ.it 
involving student teachers (e.g., Beez, 196S; Rubovits & Maeln;, 1972), Third, 
there are a number of studies in which teacher expectancy effects in elenenc.irv 
school classrooms, or other classroom situations, liave been investiy,ated (e.^;., 
Rosenthal ^ Jacobson, 1968; Claiborn, 1969),. 

In the remainder of this paper the . term "teacher-bias or experiir.enter-bias ' 
effects" will refer to si^^nificanc effects due to teacher /experimeucev difierent 
expectations for children's performance, but only in the case iuvoivijV;; indu.-tio 
of expectancies bv a principal iriVesti^ator . Th.at is, bias L^f foots will he ^-'.c- 
to a manipulation, or attempted manipulation, of expectancies b\ inve3Li.:.,w:or 
O :h effects are analagous to the effects renorted bv Rosentiial (J 966^ and 

ERIC 



-2- 



Rosenthal and Jacobson (1963) and are bias in the sense that the aduli 



has differenlial expectations regarding the perforriancc of cliildren v;ho 



are equivalent on some objective measure. The term "expectancy effects" 
will refer to sii^nificaut effects due to the adults^ own^ seif-^;unerated 
'expectations rej.»arding children's performance* In this case, it is the 
adults' ov;n expectancy,, formed however adults form it, which is related 
to children's performance. This distinction will prove critical in 
interpreting the findip.j_;s reported belovj. 

The first study of teacher bias effects v:as conducted by Rosenthal and 
Jacobson (1968) in an elementary school servins; primarily a lov/er social 
class neighborhood. At the beginnins]; of the scliool year all the children 
in grades 1-6 were given an 1(.) test, Flanai^an's (1960) Test of General 
Ability (TOGA), disguised as a test to predict "academic bloomin^v". Tlie 
test v:as given again at the middle and end of tiie school year. Wiuliin each 
of the 18 classrooms approximately 20% of .the children were randomly cliosen to 
form an experimental group. The names of tliese students v:ere t;iven to their 
teachers and it was explained that these children had scored on tlie test in 
such a manner as to predict that they would shov; large gains in intellectual 
ability during the school year. Across all classrooms the year-end lest 
scores showed an approximately 4 point advantage for the children in tlie 
experimental group. However, at the first- and second-grade levels tlie children 
in the experimental group shov;ed gains of as much as 15 10 points more than the 
children in che control group. In terms of school performance, the cliildreu 
in the experimental group showed a significantly better gain than the children 
in the control group only for reading, one of the 11 school grades considered. 
On the basis of these data Rosenthal and Jacobson (1968) concluded tr.at the 
^lldren in the experimental group gained more than children in the control 
p during the course of the academic year because the teachers expected 




a higher level of per torn Mice from tliem. 

R. L. Ttiorndike (1968, 1969) iuu; c:ritici::ea Lhe RosciUhal and 
Jacobiion research un .several i^roimJ.^;, inclu'iini; faulty \n'c- aud post- 
test data and the uUi;r;estiou that students iitay nut liave attempt^'J a l.ir/,e 
number of items, thus lov/erxnv^; their IQ scores and, essentially, ir.akiny, 
the test a poor measure. Jensen (1969) has attached ti^e Rufienthal ana 
Jacobson research oi three jjrounds: a) the same 10 test was used ior the 
pre- and post-tests; b) the teachers administered the tests; c) the child 
was tJie unit of analysis instead of the cLuisroom, In addition, Claiborn 
(1969) has argued that many of the findin^;s are unconvincing; since tliey did 
not reach standard levels of sir,nif icance and were not predicted prior to 
the invcstir,ation. Rosenthal (1968; 1969b; 1973) has convincing ly repi led 
to many of these criticisms. However, other criticisms of the Rosenthal and 
Jacobson research remain (e.^;,, see Llashoff & Snow, 1970, 1971; !vosenti\al *n 
Rubin, 197J ). As Snow (1969) has ari;ued , "Rosenthal and Jacobson \/il I have 
made an important contribution if their work prompts others to do sound 
research in this area. But their study has not come close to providing ade- 
quate demonstration o2 the phenomenon or understanding; of its process.'* At 
the present time, no other conclusion rep.ardinv; this research seems reasonable 
or possible. The Rosenthal and Jacobson research has stimulated a number of 
studies exploring various asiiects of teacher bias, or self-f ulf illinv; propljocy , 
effects, however. It is these v;hich shall be reviewed next. 

Claiborn (1969) attempted to replicate the Rosenthal and Jacobson research. 
Hot only were some of the teachers led to believe certain students would shov; 
much progress intellectually during the remainder of the year, but some of the 

• 

o 

ERIC 



-4- 



ciassi'ooms were observed in order Co obtain data concerning the ycudenc- 
Leacher inCeracLions . Tlie results indicated no differential uain in 10 
beCv/eea the experimental and control children. Furthermore, there v;as 
no indication that teachtrrs behaved differently tov;ard tlie control and 
experimental children. Since tlie biasing statements v.'cre introduced well 
into the school year (Spring semester) and since th.e length of tlie study 
was only 2 months, the results are difficuj.t to interpret. Perhaps the 
teachers had their own well~i:ormed opinions of the students' potential 
and the opinion of an "outsider" v;as just not seen as valid. Perhaps, too, 
the two month interval between biasing and post-testing was not long enough 
for the effect of teacher expectc'incy to become critical to the students' 
performance. » 

Several other studies have attempted to replicate the findings of 
Rosenthal and Jacobson with varying degrees of success. Evans and Rosenthal 
(1969) found that for Kindergarten through fifth grades the boys in the exper- 
imental group gained more IQ points in the reasoning subtest than the bovs In 
the control group, with the reverse holding for the girls. There were no 
effects for Verbal IQ or Total 10 scores. Anderson and Rosenthal (1968) 
report a failure to replicate v;ith familial retarded boys. Meiclienbaum, Bowers, 
and Ross (1969), using female adolescent offenders, reported that the"bloomers" 
showed more improvement on objective but not on subjective tests th^an did the 
control group. This study is of particular interest since it focused -on 
academic ])erformance rather than 10. Furthermore the classroom observations 
revealed that the "bloomers" significantly improved in terms of appropriate 
behavior more than did the control group. 




'Experiment: I 

A Longitudinal Study of Teacher-bias and Teachcr-exi^ec lancy Effects 
on Elementary School Children's Achievement Test Performance 

The purpose of this experiment was to investigate teacher-bias and 
teacher-expectancy effects on elementary school children's achievement test 
performance. Teacher-bias is defined as above, that is, an expectancy for. 
performance as induced by the principal investii^ator , Analagousiy , teaciier 
expectancy is defined as the teacher's ovni self-generated (stated) expectations 
regarding children's performance. In this experiment, as vrlll be noted belov.\ 
teacher-bias was manipulated by statements from the principal investigator and 
teachers-expectancies v;ere measured by teachers' rankings regarding year-en.d 
academic performance levels . 
Subjects 

The subjects were 32 second-graders (CA" = 8.60 years), 13 boys and 19 
girls, and 32 fourth-graders (CA = 10.73 years), 15 boys and 17 girls, attending 
.a school serving primarily a lower-class population. There v;ere 16 subjects 
in each of tv/o classroom' in each grade level. 
Procedure 

During the first week of the 1971-1972 academic year several subtests 
from tiie SAT battery were administered by the principal investigator to each of 
the classrooms involved in the study/ The subtests administered included: V.-ord 
Reading, Paragraph Meaning, Spelling, Arithmetic Computation, and Arithmetic 
Concepts (fourth-grade only). The Primary I and Partial Intermediate 1 batterie 
were used for the second- and fourth-grades, respectively. The SAT's were 
disguised as tests to measure potential gains in language and arithmetic skills. 



-6- 



The sa-e subtests vere adz^inif lered ::iddlc a-d er,! cf IVTl-::?;^ 

academic year. The SAT's were acain ad::"inistcred nt t::ie berjnr.in^ ar.d ' 
middle of tlie 1972-1973 academic year, the children now being in the. Liiird- 
and fifth-grades. Subtests from the primary II and Partial InLen-iediafe I 
were now employed for the third- and fifth-graders respectively. It is ir.oert- 
ant to keej) in r.iind uhat the subjects verc, at liiis 'cir.e, in ;crridr: laveli 

witii new teachers. 

Durin>;^ the initial testing session each teaclier v;as asked to rank the 
children in her classroom fro::: I-vl based on iier e:<pec taticns reg::rdin.: ti^.ei:' 
year-end per Corir.ance levels in langua^;:^ and arithmetic skills. In each ilai.-.- 
room the children ranked i-lb were randoirily and e.:aallv divided incc .ir. 
expcr :'.n;ental and a control group. One v:eek after the initial te.stin,; oaci. 
teacher U'as given the na::ies of the children in ch.e experimental groa:: a::.: v;as 
told that, on tiie basis of the tests, these children should shov; large gains 
in language and arithmetic skills during the academic year. It should be noted 
that no further mention of these children was made to any teacher th.roc^ihou': 
remainder of the study, a' year and. a half. 
Results 

The dependent variables vjere total SAT raw scores for each testing session. . 

Originally, the design was conceived as a three-way factorial arrangement, 

including experimental vs. control groups, grade level, and teacher ranking. 

However, due to subject iittrition there were no'fan equal number of subjects 
3 

in each cell. Rather than solve the analysis problem by application or the 
unweighted means solution to the analysis of variance the multiple regression 
approach" of Cohen (1968), Overall and Spiegel (1969) , and Overall (1972) was 
employed, ^ « 

O 

ERLC 



-7- 



The. results of the multiple ref^ression analyses are summnri/:cd in 
Table 1, The- means associated v;ith the main effects of 'the multii)le 
regressioai analyses are- presented in Table 2. As may be seen in Tabic 1 
the bias manipulation (Experimental Condition) v;as not significantly 
related to SA.T performance on any of the five testing] occasions. Grade 
Level was riignif icantly related to performance on SAT-2, SAT-35 and SAT~A. 
As may be see\i in- Table 2, the yoimger S_s scored liighcr than tlie older 
Ss on SAT-2 and SAT~3 with the reverse being the case for SAT~4, 

Teacher ranking was strongly and consistently related to SAT performance 
on each testing occasion. In general, the higher the teacher *s ranking the 
higher the child's SAT performance (see Table 2), The corirela tions between 
SAT performance and Teacher Ranking ^ presented in Table 3, reflect the ^'^tength 
of the relationships detected in the multiple regression analyses. 
Conclusious 

The findings are quite conclusive v;ith respect to the importance of 
teachers-bias and teacher-expectancy effects on children's academic 
performance. Clearly, sir.iply telling teachers certain students would be 
performing well at the end of the academic year V7as not sufficient to increase 
these students' SAT performance. It appears that the teachers biased neither 
the SAT perlformance nor the classroom learning of the children in the experimental 
or control groups. This appears to be the case for both short- and long-term 
effects due to teacher-bias. This finding does not replicate tlie findings of 
Rosenthal and Jacobson (1968). When considered in conjuiictiou with other research 
(e,g., Claiborn, 1969, Mderson ^ Rosenthal, 1969; Evans ^< Rosenthal, 1969; 
Fleming ^ Anttonen, 1971; Jose 6 Cody, 1971) which has also failed to replicate 
the Rosenthal and Jacobson findings, however , it seems quit^i: clear that teachers 
do not hias students' performance. 

ERLC 



.-8- 



H 
fD 



p 



-J 



A A 

II Jl • - 

o o 

M M O M 
to ^ '"^ 



t:^ >o 

II Hi 

Q P C 

P O 

O P 

(0 CX 



ro 

t o U) O 

4> U> to Ul 

U) U> O U) 

ro ON 



ro 

Uj 4> M Ut 

O (jO ^ 

O vT) ^ 

a^ ui to 



00 

H» to 

Co Ln to O) 

CO CO 4> to 

-'r O ^ Ln 



ro 
o 

^ 00 O -i-^ 

M Ln to 

M M to 

o vD 



to 

CO 

to T> O -(-^ 

CO M Ln 

v£) o o 

55* LJ Ln CO 



(-rj c 

H* ro to 

II ^n 

hT) >0 i-.l 

fJ CI C 

o 



CO 

p 



(0 



O LO LJ 

A O -f-^ L.'i 

M Ui CO LO 

Ui to G^ 



O 

-f> M i> L-i 
Lo O 4> 
M O 

Ln ^ ro 



Ln 
Ln 

CO LO 0> 

ui o 00 

t^ Ln ro 

-P^ M Ln 



ON I-' LJ 

^ M O to 

LJ o to 

ON U) o 



6 -p- -^^ 

A O Ut Ui 
H * O O 

CO 



rr 



O 
O 

H- 
rr 
H» 
O 
3 



II hii 
hn 

Q 

r-> 



ro ro 

o c 



O ■(> ^ 

A O Ln Ln 

M a> LJ V.O 

M 00 



C 



ro 



- U1 

cr 



o 

O 



O 



P 
CI. 



I 



O 











/• N 














o 














o 


CO 






O 


LJ 


LJ 


rr 




p' 


A 


o 


L/1 


L'l 




H 






o 


LO 


LO 




1 






o 


CN 


ON 




M 














P 


n 














o 














p 






































/-^ 




r-r 














UJ. 












CO 


O 


















o 


Ln 


L'l 








A 


o 








r 






o 


ON 






ro 


rr 




ON 


Ln 


to 




P 


























ro 




p' 














O 






















































CI 


- H 




o 


ON 








Q 


A 


o 


CO OC 






■:/] 




o 




to 




V 


rr 




^ 




Ui 




LO 


H' 












P 


P 






































ro 


















O 














'J) 














w 
























f; 


in 


O 




o 










P 


A 


o 


to 


to 


rr 


H 




M 




h- 


to 




1 






ON 




VD 




^-^ 














cr 





c 



-9- 



Table 2 

Mean Stanford Achievcuieiit TcsL Scores i or 
Each -Condition at Each Testiinr session 



Condition 



SAT-1 



Mean Test Score 
SAT-2 SAT--3 



SAT -4 



SAT-3 



Coadition 

Experimental 
Control 



58.35 
56.63 



79.15 
76.51 



96.30 
96.71 



75.53 
6C.5S 



8A. :i 
81. II 



Grade Level 

Second (Third) 
Fourth (Fifth) 



57.59 
57.38 



85.63 
70.37 



111.38 
79.95 



65.09 
81.63 



8A.91 
79.56 



Teacher ranking 
Rankings 1-A 
Rankings 5-3 
Rankings 9-12 
Rankings 13-16 



75.56 
60.13 
52.50 
41.75 



96.55 
81.48 
73.21 
60.06 



116.13 
100.82 
89.08 
80.00 



101.75 
67.13 
76.36 
49.73 



116.50 
36.50 
77.82 
60.09 



The grade level listed in the parentlieses refers to SAT-4 and SAT-5. 

Teacher riinkiug v/cis entered as a continuous variable in the multiple regression 
analyses. The data are grouped here simply for convenience. 



ERIC 



4 



-iO- 



Table 3 

Correlations BeLwecn Teacher Ranking and SAT Performance 
Across All Grade Levels and Conditions 



SAT r 



1 
2 
3 
4 
5 



Note. - n = 51 for SAT-1, SAT-2, and SAT-3 and n = 38 for SAT-4 and 
SAT-5. All r's are statistically significant (p <.001). 



-.59 
-.67 
-.55 



-11- 

Teacher Ranking was related to SAT performance on eacli testini; occasion. 
Children ranked higher by the teacher had higher SAT scores than ciiildren rankc i 
lower* This effect has been deemed a teacher expectancy effect since it reflects 
the teacher's ov;n self --generated expectancy for the child's i>erl-ormance. 

There is some evidence in the present study v;hich supports the ar^;ument that 
this teacher-expectancy effect is not a teacher-bias effect in the Rosentlial and 
Jacobson (1968) sense. The first piece of evidence is the correlation betv;cen 
Teacher Ranking and SAT-1 performance. If this Leacher-expectancy effect wave, duo 
to teachers somehow biasing the test performance of the children it is unlikely that 
the magnitude of the correlation v/ould have been as large. Second, teacher ranking 
was'.^related to SAT performance 12 and 18 months after the ranking was made, the 
students now being advanced one grade level and under the tutelage of a uev; teacher. 
It is unlikely that this could be the case were the relationship based on a biasing 
influence by the teacher of the students- performance. Finally, the teachers reported 
that their rankings were based on criteria directly relevant to academic abilities, 
, e.g., previous grades, readiness tests, and current classroom performance. 

These effects due to teacher-expectancy appear to reflect the teacher's ability 
to accurately estimate the relative academic ability of the children in her classroom. 
The longitudinal data presented above appear to support this contention. Future 
research should focus on determining the exact bases used by teachers to form e::pjc- 
tancies regarding students' abilities and the relationship of these bases to actual 
student performance as v/ell as to teacher-student interaction in the classroom. Such 
research will not only clarify the nature of teacher-expectancies but also the role of 
the teacher in the child's cognitive and social development. 

In order, to gain a relatively complete understanding of children's reactions to 
adult expectations two studies aimed at examining experimenter-bias effects were also 
undertaken. Since this literature has been reviewed in several readily available 

ERIC 



-12- 

sources (e.g., Rosenthal, 1966; Barber & Silver, i968a, 1963b), and since the 
area is somev;hat tangential to the iiiajor purpose of this symposiuni, the procQdures 
and data v;ill be presented in an abbreviated form, A ir.ore detailed report is 
available fcoiu the author. 

Experiment 11 

A Developmental S tudy of Experimenter Bias 
Effects with Children as Subjects 

Although E^-bias effects have been shown in studies usin^ children as S_s 

(eg,, Dusek, 1971, 1972), no information regarding developmental trends in 

susceptibility to jC-bias effects is available. The major purpose of this experiment 

was to test for possible developmeiital trends. 

Subjects 

The subjects were 48 first (CA = 7 yrs. 4 mo,, SD 9 mo,), 48 third- (CA - 9 yrs. * 
5 mo., SD = 7 moO, and 48 fifth-graders (CA = 11 yrs, 6 mo,, SD =^ 8 mo,). Half the 
children in each grade level were males and half were females. The children attended 
a school serving primarily a lower-class neighborhood. 
Experimenters 

The experimenters were six male and six female college students (CA = 19 yrs. 
11, mo., SU = 7 mo.) enrolled in the introductory psychology course at Syracuse 
University. Each E_ participated in both a group and an individual trainini; sessJ.on 
prior to testing the children (see below). During the experiment each tested tv:o 
boys and two girls from each grade level* half the experimenters of each sex v-ere 
randomly assigned to each bias condition. 
Apparatus 

The apparatus has been described in detail elsewhere (SteveiTson ^ Fahel, 1961). 
Briefly, it consisted of a table with tv/o'bins and a transverse upright panel which 
served, as a shield. The left bin contained approximately 1000 marbles. The tabic 




:op above the right bin contained five randomly placed holes through which the marbles 



-13r 



could be dropped. An Esterlinc Angus Event Recorder, shielded from S_*s viev/, 
v;as connected to microswitclies belov; the holes and wa3 used to obtain an 
automatic and permanent record of S^'s responses. The exp-^riiiient was conducted 
in an area of the school free of distractions. 
Procedure 

Exper imenter training; . Each experimenter was randomly assigned to one of 
the two bicis conditions. All experimenters (n-6) in tlie same bias condition 
attended the same group training session. The experimenters v:ere shovni the 
apparatus and the procedure was briefly outlined and demonstrated. The exper- 
imenters were then told they would be testing children in the public schools 
and the follov/ing biasing statement was made: 

We have used this task y^lth this age children before and it 
has been found to be a sensitive measure of children's moti- 
vation. In fact, previous research shov;s that one of the 
findings V7e should expect to get is that boys (girls) V7ill drop 
the marbles faster than girls (boys). 

The procedures were then demonstrated again and each experimenter practiced • 
the task. Each experimenter subsequently met with a graduate assistant to 
further practice the procedures. 

Experimental task . The experimenter brought the subject to . the testing room 
and read the instructions telling the subject to pick the marbles up one at a Lime 
and put them into the holes. As the subject picked up the first marble the exper- 
imenter started a stop watch and allov/ed the child to perform at the task for 
seven minutes . . 

During the first or baseline minute of the task the experimenter remained an 
attentive but nonresponsive observer of the subject's performance by glancing at the 
marbles and holes while avoiding looking at the subject. During the next six minutes, 
the experimental period, the experimenter -ajsed verbal reinfoxcers on a Fixed Interval 
30-second schedule contingent on a marble drop. Six reinforcing statements were used: 



d, Fine, That's good. That's fine, Very good. Very fine. Each subject received 




-14- . 

each statemeuL twice in a predetermined random order, Eacli experiuieuv ^r 
tested tvjo boys and two girls at each grade level using this procedure. 
Design ■ 

The experimental procedures required a 3 (Grade Level) x 2 (Sex of j^^) 
X 2 (Bias Conditions) x 2 (Sex of S^) x 7 (Minutes) analysis of variance design 
with six subjects in the smallest cell. 
Results 

D ependent measures * There were two dependent variables of interest in the 
study: the base rate of response (the number of marbles dropped in the first 
minute of the task) and a series of difference scores computed separately for each 
subject by subtracting the number of marbles dropped in the first minute from the 
number of marbles dropped in each subsequent minute (Minutes 2-7). Tlie correlation 
between the base-rate score and the average difference score was -,4133 (n = 144, 
p <*01) indicating that although the two variables are correlated only 17,1/= of the 
variance in the difference scores is accounted for by the initial base rates. 

Analysis of base-rate scores . The base-rate scores were subjected to a 
3 (Grade Levelj x 2 (Sex of Experimenter) x 2 (Bias Condition) x 2 (Sex of Subject:) 
analysis of variance (see Table 4), The mean base-rate scores for each main 
effect are presented in Table 5. 

As may be seen in Table 4, there v;ere tv;o significant effects. The significant: 

Grade Level effect reflected a general increase in base rates with increasing grade 

levels, Nev/man-Keuls comparisons (Winer, 1962, p. 80) revealed tliat the means for 

each grade level were significantly different from each other (all p <.01). The 

Bias Condition x Sex of Subject interaction was also significant. The means are 
presented in Table 6. Individual comparisons (Winer, 1962, p. 207ff) revealed a 



ERIC 



-15- 



Table 4 

Analysis of Variance of Base- Rate Scores 



Source 




Cl L 


i'lO . 


!• 


Grade Level (A) 


2 


604.000 


23.16 


Sex of 


E (13) 


1 


4.000 


<1 


Bias Condition (C) 


1 


103.313 


3.96 


Sex of 


S (D) 


1 


.438 


<1 


A X 


B 


2 


14.656 


<1 


A X 


C 


2 


- 56.188 


2.16 


A X 


u 


2 


14.094 


<1 


B X 


C 


. 1 


1.813 


<1 


B X 


u 


1 


8.000 


<1 


C X 


u 


1 


277.813 


10.65 


A X 


B X C 


2 


31.375 


1.20 


A X 


B X D 


2 


3.063 


<1 


A X 


C X U 


2 


68.375 


2.62 


B X 


Cxi) 


1 


84.000 


3.22 


A X 


B X C X D 


2 


21.750 


<1 


error 


120 


26.078 





-16- 



Tablc 5 

Mean Base-RciLe and Mean Difference Score 
for Each Main Effect 

Effect Mean Mean 

Base-Rate Difference Score 



Grcxde Level 

First 20. G9 ~ -.4 A 

Third 23.46 1.59 

Fifth \ 27.73 1.59 

Sex of E 

Male 23.79 .95 

Female 24.12 .88 

Bias Gonditioa . ■ . 

To Males 23.11 ri.06 

To Females 24.81 .76 

Sex of S_ 

Hale 24.01 ' .37 

Female 23.90 1.46 



ERIC 



-17- 



Table 6 

Mean Base-Rate Scores for the Bias Condition 
X Sex of Subject Interaction 

Sex of Bias Condition 

S ub J e c t - Males Fema 1 e s 

Males 21.78 26,25 

Feincxles 24,44 23.36 



-18- 



sigiiificant Bias Condition effect only for the boys (F ^ 13.79, df = 1/120, 

p <.001), but siguificaiiu So: of Subject effects for both Bias to Males 

(F = A.88, df = 1/120, p <.05) and Bias to Females (F - 5.7G, df - 1/120, p <.05). 

Analysis of Difference Scores_. The difference scores v;cre subjected to a 
3 (Grade Level) x 2 (Sex of Experimenter ) x 2 (Bias Condition) x 2 (Sex of Subject) 
X 6 (Minutes ) analysis of variance (see Table 7). 'The means for the main effects 
are presented in Table 5. The si^^nificant Grade Level effect reflected higher 
difference scores for. the third- and fifths-graders than for the first-graders 
(see Table 5). Female _Ss had higher difference scores' than .nale _Ss (see Table 5). 
These interactions are of limited interest hov;ever , in view of the significant 
Grade Level x Bias Condition x Sex of Subject interaction (see Table 8). Individual 
comparisons (Winer, 1962, p. 344) were conducted on the bias condition x sex of 
subject means separately for each grade level and tests of simple effects (Kirk, 19bS, 
p. 289ff) were conducted on the grade level x sex of subject means for each bias 
condition. The individual comparisons revealed no significant Bias Condition or 
Sex of Subject effects at the first-grade level. At the third-grade level there was 
a significant Bias Condition effect (F = 12.20, df = 1/120, p <.001) for the males but 
not for the females. There were significant sex differences for both the Bias to Male * 
(F = 7.36, .df = 1/120, p <.01) and Bias to Female (F = 4.22, df = 1/120, p <.05) con- 
ditions. At the fifth-grade level the bias conditions were signif ic^mtly diffe?:ent 
for both the male (F = 7.72, df = 1/120, p <.01) and female Ss (F = 16.11, df = 1/120, 
p <.001). The tests of simple effects revealed that for the Bias tov;ard Males condiulon 
there was a significant age effect (F = 10.43, df = 2/120, p <.001) for tlie male _Ss 
but not for the female S_s. In the Bias toward Females condition the age effect was 

♦ 

ERIC 



-19- 



Table 7 

Analysis of Variance of Difference Scores 



Source 




df 


MS 


F 


p. 


Grade 


Level (A) 


0 


394 


.066 


7.22 


<.001 


Sex of 1 


i (B) 


1 


1 


. 260 


<1 




Bias Condition (C) 


1 


19 


.260 


<1 




Sex of S (D) 


1 


254 


.584 


4.67 


<.05 


A X 


B 




2 


19, 


.448 


1 
J. 




A X 


C 




2 


81 


.816 


1.50 




A X 


D 




2 


225, 


.876 


4.14 


<.05 


B X 


C 




1 


13, 


.751 


<1 




B X 


D 




1 


10, 


.446 


<1 




C X 


D 




1 


1641, 


.760 


30.10 


<.00l 


A X 


B 


X C 


2 


6, 


.689 


<1 




A X 


B 


X D 


2 


78, 


.300 


1.43 




A X 


C 


X D 


2 


166, 


.774 


3.06 


■ <.06 


B X 


C 


X D 


1 


19. 


,862 


<1 




A X 


B 


X C X D 


2 


4, 


,689 


<1 




error 




120 


54, 


,546 


<1 




Minutes 




5 


• .874 


<1 




A X 


E 




10 


12.205 


2.48 


<.01 


B X 


E 




5 


8. 


,841 


1.80 




C X 


E 




5 


3. 


074 


<1 




D X 


E 




5 


3.493 


<1 




A X 


B 


X E 


10 


6. 


853 


1.40 




A X 


C 


X E 


10 


6. 


297 


1.28 




A X 


D 


X E 


10 


5. 


651 


1.94 




B X 


C 


X E 


5 


4. 


660 


<i 




B X 


D 


X E 


5 


2. 


659 


<1 




C X 


D 


X E 


5 


11. 


008 


2.24 


<.06 


A X 


B 


X C X E 


10 


3. 


480 


<1 




A X 


B 


X U X E 


10 


4.914 


1.00 




A X 


C 


X D X E 


10 


3.996 


<1 




B X 


c 


X D X E 


5 


8.826 


1.80 




A X 


B 


X C X D X E 


10 


1.891 


<1 




error 




600 


4. 


911 







ERIC 



-20- 



Table 8 



Mean Difforencc Scores for the Grade Level 
X Bias Condition x Sex of Subject Interaction 



Bias Condition 



Sex of 
Subj ect 



First 



Grade J.evel 
Third 



Fif tU 



Males 



Male 
Female 



-.03 
-.56 



3.94 
.60 



1.78 
.64 



Females 



Male 
Female 



-1.47 
.31 



-.36 
2.17 



-1.64 
5.58 



Table 9 

Mean Difference Scores for the Grade 
Level X Minutes Interaction 
Minutes 

Grade Level 2 3 4 5 6 7 



First .1.2 -.21 -.75 -.46 -.19 -1.15 

Third 1.02 1.79 1.33 1.83 1.88 1.67 

Fifth 1.62 1.27 1.71 1.56 1.02 2.35 



ERIC 



-21- 




(F = 18.86, df = 2/120, p <.O01). In effect, both the individual comparisons 

and the tests of simple effects rovGal sradce level differences and sex differences 

ill susceptibility to subtle cues emitted by the experimenters. 

There were two v/ithin subjects C9Tnparisons v/hich were significant. The 
•means for the Grade Level x Minutes Interaction tnay be seen in Table 9. In 
general, the performance rates of the first-graders declined over time but the 
performance rates of the third- and fifth-graders increased over time and then 
remained relatively stable. 

The Bic^.s Condition x Sex of Subject x Minutes interaction approached the 
traditional p <.05 level of significance. The performance curves reflected by 
this effect are shown in Figure 1. Individual comparisons (Winer, 1962) of each 
pair of means for each minute revealed no significant differences betv;een the 
sexes in the Bias to Males condition. Individual comparisons for the Bias to 
Females condition, however , revealed significant sex differences at each minute. 
Discussion 

Generally speaking, the experimenters did bias the performance of the children. 
The significant Bias Condition x Sex of Subject interaction in the analysis of ulie 
difference scores reveals diffei-ences in the predicted directionj i.e.^ boys perfonued 
at a higher rate than girls for experimenters biased tov;ard males and girls performed 
at a higher rate than boys for experimenters biased toward girls. 

The major finding with respect to the predictions was the Grade Level x Bias 

( ■ 

Condition x Sex of Subject interaction (Table 8) v/hicli revealed clear developmontai 
trends in susceptibility to experimenter bias effects. At the first-grade level thoro 
were no significant Bias Condition effects for either the male or female _Ss, althou<;h 




-23- ' ^ . 

the means were in the predicted directions. At the third-grade level the 
Bias Condition effect v/as significant for the males j the mean difference score 
was higher if the experimenter was biasefi toward males than females; for the 
females the Bias Condition effect was not significant although the means v;ere in 
the predicted direction- At the f if tl\-grade level the Bias Conditi.on effect X'/as 
significant, for both the male and female subjects with the means in the 
predicted direction. 

The above findings indicate clear developmental and sex of subject trends 
in susceptibility to expe'rimenter-bias effects . Although the exact bases of 
these trends is difficult to elaborate at the present time it may be that as 
children become more developmentally mature tliey are better able to intcrprec 
the subtle cues emitted by the experimenter and tend to comply with the inter- 
pretation placed on thei cues. The processes involved may be similar to those examined 
by Flavell (1968) in connection with children's role-taking and communication skills. 

Experiment III 

Adult Expectancy Effects: Self-generated 
versus Induced Expectancies 

When evaluating adult expectancy effects there is but little evidence 
relating to the importance of the manner by which the adult acquires the expectancy ; 
for the to-be-produced outcome. Some of the available evidence (e.g., Dootzin, 
1971) . suggests that self -generated expectancies relate more to obtained bias 
than expectancies Induced by the principal investigator. Hov/ever, there is other 
evidence (e.g., Marcia, 1961; Mansrit Marcia, 1967) which does not support this 
position.. Experiment III was aimed at assessing the importance of mode of develop- 
ment of adult expectancies vis-a-vis the effectiveness of adults to bias the 
simple motor performance of children. 

o ' : . . •■ 

ERIC 



-24- 



Subjectzs j ' 

The subjects were 48 kindergarten children (CA = 5 yrs. 11 mo,, SD 8 mo.), 
half males and half females. The children attended a school serving primarily 
a lower-class neighborhood, " 
Experimenter s 

The experimenters were 12 male college students (CA = 19 yrs,, 9'mo,, SD - 
13 mo.) enrolled in the introductory psychology course at Syracuse University, 
During the experiment each E tested two boys and tv/o girls. Etich was randomly 
assigned to one bias condition and one induction condition. 
Apparatus 

The apparatus v/as identical to that used in Experiment II, 
Procedure 

Experimenter training , With the exception of the group training sessio.i 
the experimenter training was essentially identical to that of Experiment II, 
Each experimenter was trained individually. Experimenters assigned to the Induced 
Bias Condition were given the same statement ais was' given in Experiment II, Exper- . 
imenters in the Self -generated Bias Condition were asked to predict whether boys or 
girls would drop the marbles faster. Each experimenter practiced the task adminis- 
tration procedures with a graduate student, 

r 

Experimentcil task . , The procedures for the experimental task V7ere identical 
to those employed in Experiment II. 
Design 

The design of Experiment III was a 2 (Induction Condition) x 2 (Bias Condition) 

X 2 (Sex of Subject) x 7 (Minutes) factorial design with six subjects in the 
smallest cell. 



ERiC 



Results 

Dependent measures ; 'As in Experiment II, the dependent measures were tlie 
base-rate of response and six difference scores, .one for each rainute of Llie exper- 
imentai period of the task. The correlation between the base-rate and tlie average- 
difference score was -.62 (N = 48, -p <;01) , indicating tliat approximately 38;; of 
the variance in the difference scores is accounted for by the initial base^-rate 
levels . • - 

Analysis of base-rate scores . The- base-rate scores v/ere subjected to a 
2 (Induction Condition) x 2 (Bias Condition) x 2 (Sex of Subject) analysis of 
variance (sea Table 10)- Theiuean base-rate scores for each main effect are 
presented in Table 11. As may be seen in Table 10 the' only, significant effect 
was the triple interaction involving Induction Condition, Eias Condition, and Sex 
of Subject, The .means for this effect are presented in Table 12. Individual 
comparisons revealed the following; a) the only significant sex difference 
(F = 7,08, df =. 1/40, p <.05) was in the Self-Generated Bias to Females Condition; 
b) the only significant Bias Condition difference was for the male S_s in the_ 
Self -Generated Induction Condition. 

Analysis of Difference Scores . The difference scores were subjec;:ed 'to a 
2 (Induction Condition) x 2 (Bias Condition)' x 2 (Sex of Subject) x 6' (Minutes) 
analysis of variance with repeated measures on the last factor (see Table 13) . Tb.e 
means- for the betweenj^ub jects main effects are presented in Table 11. The only 
significant between-subj ects effect was the Bias Condition x Sex of Subject 
interaction (see Table. i4);. Individual comparisons (Winer, 1962, p. 344) revealed 
significant Sex of Subject effects for Bias to Males (F = 6.48, df = 1/40, p < .05) 
and Bias to Females (F = 22.90, df = 1/40, p <-001) and significant Bias Condition 
effects for Male"_Ss (F =.10.36, df = 1/40, p <.01) and Female S^s (F = 16.92, df - 1/ 



p <-001)- . . 

There were several significarit vrithin-subjects effects. The slgnificaTit 
Minutes main" effect (See Table 15) reflected a general decrease in rate of 
response during the experimental period of the- task. The Induction Condition x 
Minutes interaction was significant (see Table 15) and reflected a general decrease 
in rate of response for S^s in the Induced condition and^ generally, an increase and 
then decrease in rate of response for ^s in the Self -Generated condition. The 
Induction condition x Bias Condition x Minutes interaction was also significant 
(see Figure 2). "Individual comparisons revealed that for tlie Induced Conddtion 

• , .J 

there v;ere significant Bias Condition effects for minutes 4 (F = 6,50, df = 1/200, 
p <,05), 5 (F = 6,03,. df - 1/200, p <-05) , 6 (F = 6,36, df = 1/200, p<-05), and 
7 (F = 6.56, df = 1/200, p <.05), but for the Self-Generated Condition the. only 
significant Bias Condition effect was for minute 4 (F = 4.27, df = 1/200, p <,05). 
Discussion ^* 

The major focus of Experiment III was to investigate the effects of mode 
of inducing .expectations in the experimenter in the obtaining of experimenter- 
bias effects with children. Although the analysis of the difference scores ■ 
revealed a significant Bias Condition x Sex of Subject interaction, indicating 
a significant experimenter-bias effect, this effect did not interact with 
Induction Conditioii. Inductioii Condition did interact with Minutes, and with Bias 
Condition and Minutes. However, these effects are not readily intcrpret'able given 
current theorizing in the area. The ciata v;ould appear to support the findinijs and 
theorizing of Marcia (1961) and Marwit and Marcia (1967) ■, indicating no significant 
differences due to Induction Condition, 



-27- 



Table 10 

Analysis of Variance of IkisG-Ratc Scores 



Source 


df 


MS 


1" 


Induction Condition (A) 


1 


17.516 


<1 


Bias Condition (lO 


1 


1.688 


<1 


Sex of Subject (C) 


1 


17.520 


<1 


A X « 


1 


46.023 


2.44 


A X C 


1 


6.023 


<1 


B X C 


1 


50.020 


2.66 


A X B X C 


] 


88.023 


4.67 


error 


40 


18.838 





-28- 



Table 11 

Mean Base-Rate and Mean nifference i^cure 
for Kach Main Kflect 



Effect 



Induction Condition 
Induced 

Self-generated 

Bias Condition 
To Males 
To Females 



Mean 
Base- Rale 



17.92 
16.71 



17.50 
17.12 



Mean 
Difference Score 



-.7b 
.56 



.43 
.88 



Sex of Subject 
Males 
Females 



17.92 
16.71 



.10 
1.20 



Table 12 

Mean Base-Rates for the Induction Condition Bias Condition 
X Sex of Subject Interaction 



Induction 
Condition 



Bias 

Condition 



Sex of Subject 
Male Female 



Induced 



To Males 
To Females 



19.67 
16.67 



18.50 
16.83 



Self-Generated 



To Males 
To Females 



1A.50 
20.83 



17.33 
14.17 



Table 13 



Analysis of Variance of Differonce Scoros 



Source 

: 


df 


MS 


F 


P 


xiiaucuioii uoiiuxLJ-un 


1 


0 TOO 










1 

-L 


14 'J77 






i)CX Oi- oUDJCCU \v*^ 


1 

1. 


o\j • uo t 






A V U 
/VXD 


1 
X 








i\ X c 


1 

J. 


O • OOX 


J. 




B X C 


1 


9A6.125 


26.675 


<.001 


A X li X C 


1 


5.01A 


<1 




error 


AO 


35.A69 






Minutes (D) 


5 


28.A31 


6.60 


<.001 


A X D 


5 


10.631 


2.A7 


<.05 


B X D 


5 


.631 


<1 




C X D 


5 


6.255 


1.45 




A X B X U 


5 


9.797 


2.28 


<.06 


A X C X D 


5 


.556 


<1 




n X C X D 


5 


3.867 


^1 




A X B X C X D 


5 


A. 256 


<1 




error 


200 


A. 306 







FRIC 



-30- 



Tabic lA 

Mean Difference Scores for Each Bins 
Condition x Sex of Subject Subr,roup 



Bias Condition Sex of Subject 

Male Female 



To Males 1.69 -.83 

To Females -1.A9 3.24 



Table 15 

Me^m Difference Scores for the Minutes x 
Induction Condition Interaction 

- , . Minutes 
Induction 

Condition 2^ 3 4 5 6 7 

Induced 2.04 1.42 .25 .79 .17 -.17 

Self-Eenerated .88 .92 1.71 .79 .17 -1.12 

Mean 1.46 1.17 .98 .79 .17 -.64 



« 



-32- . 

General ■ conclusions from Experiments IT & III: - . 

Recall that in the study of t,eacUer-bias and teacher-expectancy effects 
there were no Grade Level x Bias Condition interactions 3 indicating no 
developmental trends in teacher-bias or teacher-expectancy effects. The 
experimental study aimed at assessing developmental trends in experimenter- 
bias, however, did reveal a- significant Grade Level x Bias Condition x Sex 
of Subj ect interaction. Clear developmental trends were discernible w-ithin 
this interaction. Older children evidenced a greater susceptibility to 
experimenter-bias effects than younger children. This divergence in findings 
is most likely due to the "group" nature of the teacher-bias experiment and the 
opportunity for single-subj ect interaction in the experimenter-biiis study. That 
is, it may be more likely for developmental trends in susceptibility ' to adult 
influence to be evidenced in a one to one situation as opposed to a one to group 
situcition. 

Although the study of teacher-bias and teacher-expectancy effects revealed 
that teacher-expectancy but not teacher-bias effects related to SAT performance . 
the findings of Experiment III were inconclusive v/ith respect to differential 
effects of experimenter-bias and experimenter-expectancy in relation to simnle motor 
performance. x\gain"^ it may be that bias and expectancy effects exert differential 
influences on performance depending upon the type of situational interaction, i.e., 
group or individual. 

An alternative explanation for .the divergence of .findings must also be 
considered. It may be the case' that, on the one hand, cognitive perf crmance , ■ 
.such as is measured by achievement tests, is not susceptible to bias effects. ■ 
On the other hand, motor performance may be influenced by such effects. Obviously, 
this is a question v?hich must be answered by future research. 



-33- 



Refe7;euces 
■ . . •■ ■ V ■ 

Anderson, D. F. , & Rosenthal, R. SomeXef f ecus of interpersonal expectancy 

and social interaction on institutionalised retarded children. Proceedings 
of. the 76th Annual Convention , APA, 1968, Pp. 479-480. 

Barber, X., & Silver, H. • Fact, fiction, and the experimenter bias effect. 
Psychological Bulletin Mono^^raph Supplemen t, 196S, _70^, 1-29. (a) 

Barber-, X, , & Silver, M. Pitfalls in data analysis and inteirpretaLion: * 
A reply to Rosenthal. Psycholop.ical Bulletin Monograph Supi>leraenL , ■ 
1968, 70^ 48-62. (b) 

Beez, \'h V* Influence of biased psychological reports oil teacher behavior 

and pupil performance. Proceedings of the 76th Annual C onvention , APA, 
1968* Pp. 605-606. , " ^ 

Bootzin, R* Induced and stated" cxpect^lncy in experimenter bias. Proceedini^s of 
the 76th Annual Convention , APA, 1969, ^(1), 365-6.". 

Claiborn, VJ. L. Expectancy effects in the classroom: A failure to replicate. 
Journal of Educational Psychology , 1969, 60, 377-383, 

Cohen, J.' Multiple regression as a general data-analytic system. Psycho lor, leal 
Bulletin , 1968, TO. 426-443. 

Duselc, J. B. Experimenter-bias effects on the simple motor task performance 
of low- and hi<r,h-test anxious boys and girls. Psychological Reports , ' 
1972, 30, 107-114. ^ 

Dusek, J. B., & O'Connell, E. J. Teacher, expectancy effects on the achievement 
test performance of elementary school children. Journal of Educational 
Psychol ogy , 1973, in press. 

■ Elashoff j J. D. , & Snox^, R. E.. , A case study in statistical inference : Recon- 
sideration of the- Rosenthal-Jacobson data on teacher expectancy. 
Technical report #15, Stanford Center for Research and Development in 
Teaching, School of Education, Stanford University, December, 1970. 

Elashoff, J. D., Snow, R. E. (Eds.) Pygmalion reconsidered . Worthingtou, 
Ohio:. Charles A. Jones, 1971. 

Evans, J., & Rosenthal, Pv. Interpersonal self-fulfilling prophecies: Further 

extrapolations from the laboratory to the classroom. Proceeding s of the APA 
Convention , 1969,' 371-372. 

Flanagan, J. C. Test of general ability; technical report . Chicago: Science 
. Research Associates, 1960. 



.-34- 



Flavell, J. The d evelop m ent: of role-'takiu.e , and cbnimunicatiou skills in children . 
New York: John Vviley & Sons, Inc., 196S. 

Fleming, E. S., & Anttonen, R. G, Teacher expectancy as related to the academic 
an'd personal- growth of primary-age children. Monographs of the Society for 
Research in Child Development , 1971, _36'(50) , Serial No. 145." ^ 

Friedman, N. The socia l nature of psychologica]. research :- 22l£ psychological 
experiment as a social interaction . Nev7 York: Basic Books, 1967. ' 

Jensen, A. R. How much can we boost IQ and scholastic achievement? Harvard ; . 
Educational Rev le v;, 1969, 39, 1-123. 

Jose, J., & Cody, J. Teacher-pupil inter action as it relates to attempted 

changes ■ in teacher expectancy of academic ability and acliievement . American I'Vi . 
. Research Journal , 1971 _8, 39-49. - 

Kirk, R. Experimental design ; Procedures for the behavioral sciences . Belmont, 
Calif.: Cole Publishing Co., 1968. 

. Marcia, J. E. The need for social approval, the condition of hypothesis-makiug, 
and their effects on unconscious experimenter bias. Unpublished master *s 
thesis, Ohio State University , 1961 . 

Marwit, S. J., & Marcia, J. E. Tester bias and response to projective instruments.. 
Journal of Consulting Psychology , 1967, 31, 253-258.-^ 

Meichanbaum, D. , Bowers, K. , & Ross, R. A behavioral cinalysis of teacher expectancy 
effect. Journal of Personality and Social ' Psychology , 1969, ^lA' 306-316. 

Overall, J. E. Multiple covariance analysis by the general least' squares regression 
method. B ehavioral Science , 1972, r7, .313-320. . . ^ 

Overall, J. E. , & Spiegel, D. K. Concerning least squares analysis of experimental 
data. Psychological Bulletin , 1969, 72_, 311-322. 

Rosenthal, R. Ex perimenter effects in behavioral , research . New York: Appleton- 
Century-Crof ts, 19.66. - ^ 

Rosenthal, R. Experimenter expectancy and the reassuring nature of the null 

hypothesis decision procedure. Psychological Bulletin Monograph Supplement , 
1968, 70, 30-47. 

Rosenthal, R. Empirical vs. decreed validation of clocks and tests. Amierican " 
Educational" Research Journal , 1969, _6, 689-691. (a) 

•Rosenthal, R, Interpersonal expectations: Effects of the experimenter's hypo thesis . 
In R. • Rosenthal and R. Rosnov/ (Eds.), ^Artifact , in behavioral research . New York: 
Academic Press, 1969. Pp. 182-277. CbT" ~ 



ERIC 



-35- 



Rosenthal, R« On the social psychology of the self-fulfilling prophecy: 

Further evidence for Pygmalion " effects and their mediating mechanisms. 
Unpublished manuscript, 1973 , Harvard University* . 

Rosenthal, R. , ^ Jacobson, L, Py pjualion in the classroom . New York: Holt, 1968." 

Rosenthal, R. , & Rubiii, D. B. Pygmalion reaffirmed. In J. D. Elashoff and R. H. 
Snov7 (Eds,) 5 P y ffl n a 1 i o n Reconsidered . Worthington, 'Ohio': ' Charles A. Jones, 
1971-- Pp, 139-155. 

Rubovits, P,,.Maehr, M. Pygmalion analyzed: Toward- an explanation of the " 

Rosenthal- Jacobson findings. Journal of Personality an d Social Psycholp oy, 
1971, 19, .197-203. * . : 

Snow, R. Unfinished Pygmalion. Contemporary Psychology , 1969, L4, 197--199. 

Stevenson, H. W. & Fahel, L; The effect of social reinforcement on the per^ormance 
of institutionalized and noninstitutionalized normal and feebleminded children. - 
Journal of P ersonality , 1961, 29, 136-147. 

Thorndike, R. L. Rcviev? of Pygmalion in the classroom . AiHerican Educational 
Research Journal , 1968, 2» 703-711. 

■Thorndike, R. L. You have to know how to tell time- American Educational 
Research Journal , 1969, 692, ' . 

Winery, B. J.. Statistical principles in experimental desig n. New York: " . 
McGraw-Hill, 1962. 



ERIC 



-36- 



Footnotes . • 

1. The project •presentGcl or reported herein was performed, pursuant to 
a grant (No. OEG~-2-71~0516) from the U.S." Office of Education, 
Department of Health, Education, and Welfare* The opinions expressed 
herein, l^.ov;ever, do not necessarily reflect the position or policy 
of the U.So Off ice. of Education, and no official endorsement by the 
U.S. Office of Education should be inferred. 

"2. The author is grateful to Mr. James McGeCj Principal, and the teachers 
of the kindergarten through f if th-gvades ' of Clinton Elementary School 
for their excellent cooperation and enthusiasm. 

r ■ . _ ■ 

3. Analyses of SAT-1 scores revealed no diEfer;ences between the children 
. remaining available at the end of the first year or middle of the 
second year and those lost throughout the experiment. 




